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We present a new drift condition which imphes rates of conver- 
gence to the stationary distribution of the iterates of a i/)-irreducible 
aperiodic and positive recurrent transition kernel. This condition, ex- 
tending a condition introduced by Jarner and Roberts [Ann. Appl. 
Probab. 12 (2002) 224-247] for polynomial convergence rates, turns 
out to be very convenient to prove subgeometric rates of convergence. 
Several applications are presented including nonlinear autoregressive 
models, stochastic unit root models and multidimensional random 
walk Hastings-Metropolis algorithms. 

1. Introduction. Let (<I>n>'^ > 0) be a discrete time Markov chain on a 
general measurable state space (X, ;S(X)) with transition kernel P. Assume 
that it is ■0-irreducible, aperiodic and positive recurrent. This paper con- 
siders the use of drift conditions to establish the convergence in /-norm 
of the iterates of the kernel to the stationary distribution tt at rate 
r := (r(?i), n > 0); that is, 

(1.1) lim r(n)||P"(x,-)-vr||f = 0, vr a.e. 

where /:X — > [l,oo) satisfies 7r(/) < oo and for any signed measure //, the 
/-norm is defined as supj^j^j |M5)|- 

For geometric rate functions, that is, functions r that satisfy 

. logr(n) logr(n) 
< lim ini < lim sup < oo, 

n n 

it is known that (1.1) holds if and only if the Foster-Lyapunov drift condition 
holds that is, there exist an extended real- valued function F:X— [l,oo] 



Received January 2003; revised June 2003. 
AMS 2000 subject classification. 60J10. 

Key words and phrases. Markov chains, stationary distribution, rate of convergence. 

This is an electronic reprint of the original article published by the 
Institute of Mathematical Statistics in The Annals of Applied Probability, 
2004, Vol. 14, No. 3, 1353-1377. This reprint differs from the original in 
pagination and typographic detail. 



1 



2 



R. DOUC, G. FORT, E. MOULINES AND P. SOULIER 



finite at some € X, a petite set C, A G (0, 1), 6 > and c > such that 
c-^f<V<cf and 

(1.2) PV <XV + blc. 

In that case, the convergence (1.1) holds for all x in the set {V < oo] which 
is of vr measure 1. See, for instance, Meyn and Tweedie (1993) (hereafter, 
MT), Theorem 16.0.1. 

For rates of convergence slower than geometric, no such definitive result 
exists. An important family of such rates is the class of subgeometric rate 
functions, defined in Nummelin and Tuominen (1983) as follows. Let Aq 
be the set of positive nondecreasing functions tq such that ro(0) > 1 and 
log{ro(n)}/n decreases to 0. The class of subgeometric rate functions is the 
set A of positive functions r such that there exists a sequence ro G Aq that 
satisfies 

r(n) r(n) 

(1.3) 0<limmf <limsup < oo. 

ro(n) ro(n) 

This class includes, for example, polynomial rate functions, that is, rate 
functions r such that (1.3) holds with ro(n) := (1 + n)^ for some /3 > 0. 
It also includes rate functions which increase faster than polynomially, for 
example, rate functions r satisfying (1.3) with 

ro(n) := { 1 + log(n)}"(n + l)^e^'^' 
^'^ for a,/3GM, 7G(0, 1) and oO. 

We will refer to these rates as subexponential in order to distinguish them 
in the broad class of subgeometric rates. 

Tuominen and Tweedie (1994) [see also Nummelin and Tuominen (1983)] 
gave a set of equivalent conditions that imply the convergence (1.1) with a 
subgeometric rate function r G A. To state this result, we first recall some 
notation and definitions. 

A measurable set C is V'a-petite (or petite) if there exist a distribution 
a := (a(n), n > 0), a constant e > and a nontrivial measure ipa on B{X) such 
that, for allxeC, B£ B{X), 

Kaix, B) := a{n)P''{x, B) > MB). 

n>0 

The return time to a measurable set A is defined as ta' = infjn > l,<I>n £ 
A] (with the convention inf0 = +00). Let he a maximal irreducibility 
measure and let B~^(X) be the class of accessible sets, that is, sets B £ B(X) 
such that V(-B) > 0. A set A G ^(X) is called full if ^(A^) = 0, absorbing if 
P{x, A) = I for all x G A and, for a measurable positive function / and a 
rate function r, A is said to be (/, r)-regular if, for every B G B^{X), 

-TB-l 

supE^ J2 r(k)f(<^k) < 00. 
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A (1, l)-regular set is simply said to be regular. A finite positive measure 
A on B{X) is said to be (/, r)-regular if ^x[J2k=o^ '^^^^ fi^k)] < oo for all sets 
B G B^{X). If, for some 2; € X, the Dirak measure 5x is (/, r)-regular, then 
the point x is said to be (/, r)-regular. The set of all (/, r)-regular points is 
denoted by S(f,r). 

We can now recall (part of) Theorem 2.1 of Tuominen and Tweedie (1994). 



Theorem 1.1 [Tuominen and Tweedie (1994)]. Assume that P is tp- 
irreducihle and aperiodic. Let f .X ^ [l,oo] be a measurable function, and 
let r be given. The following conditions are equivalent. 

(i) There exists a petite set C € ;B(X) such that 



sup E^. 



TC-l 

E <k)f{^k) 

. k=0 



< 00. 



(ii) There exist a sequence of extended real valued functions {Vn,n > 0), 
T^i : X — > [1, 00], a petite set C G B{X) and a constant b < 00 such that Vq is 
bounded on C, 

Vo{x) = +00 =^ Vi(x) = +00 

and 

(1.5) PVn+i+r{n)f<Vn + br{n)lc. 

(iii) There exists an {f ,r)-regular set A G B'^iX). 
Any of these conditions implies that, for all x G S{f,r), 

r(n)||P'^(x,-)-vr(0||/=O, n ^ 00, 

and the set S{f,r) is full, absorbing and contains the set {Vq < 00}. More- 
over, for all {f,r)-regular initial distributions X, n, there exists a constant c 
such that 



00 ^ ^ 

Er(n) J J A(dx)Mdy)||^"(x,-)-^"(?/,' 

n=0 



<c{X{Vo) + fx{Vo)). 

This theorem cannot be improved since it provides a necessary and suf- 
ficient condition, but the sequence of drift conditions (1.5) is notoriously 
difficult to check in practice and one has very little insight on the way to 
choose the family of drift function (Vn,n > 0). This is why these drift con- 
ditions, to the best of our knowledge, have seldom been used directly. 
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A first step toward finding a more practical drift condition was taken 
by Jarner and Roberts (2002), wlio, simplifying and generalizing an argu- 
ment in Fort and Moulines (2000), have shown that if there exist a function 
V [I, oo] finite at some xq £ X, positive constants b and c, a petite set 
C and a S [0, 1) such that 

PV + 6^" <V + blc, 

then the chain is positive recurrent and, for each /? G [1, 1/(1 — a)], the con- 
vergence (1.1) holds for all j; € {1^ < oo} which is of vr measure 1, with 
r(re) := n^~^ and / := it is noteworthy that there is a balance be- 

tween the rate of convergence and the norm: the larger the latter, the slower 
the former. In particular, the fastest rate of convergence [r{n) = n"/^'""-*] 
corresponds to the total variation norm, and the slowest rate (r = 1) corre- 
sponds to the V°' norm. 

In this paper, we consider the following drift condition which generalizes 
the Foster-Lyapunov and the Jarner-Roberts drift conditions. 

Condition D(0, V, C). There exist a function y : X ^ [1, oo], a concave 
monotone nondecreasing differentiable function 0:[l,oo] i-^-CO, oo], a mea- 
surable set C and a finite constant h such that 

PV + <i)oV <V + b\c. 

Here (j) is assumed differentiable for convenience. It can be relaxed since a 
concave function has nonincreasing left and right derivatives everywhere. If 
Ti{4>,V,C) holds for some petite set C and there exists xq € X such that 
V{xq) < oo, then the /-norm ergodic theorem (see MT, Theorem 14.0.1) 
states that there exists a unique invariant distribution vr, 7r(0 oV) < oo and 

Um||P"(x, O-vrlUoy = 0, 

n 

for all X in the set of vr-measure 1 < oo}. If, in addition, tt{V) < oo, then 
there exists a finite constant B such that, for all x € < oo}, 

oo 

^ ||P"(x, •) - ttW^oV < B{\ + V{x)). 

The ((?l)o y)-norm is the maximal norm for which convergence can be proved 
under Condition 'D{(j),V,C) and in that case, the rate of convergence is 
minimal: r = 1. This implies that, for any function I < f < (poV , convergence 
in the /-norm also holds. In order to determine the rate of convergence in 
the /-norm by means of Theorem 1.1, we should try to find a sequence of 
functions (V^,n > 0) such that (1.5) holds, but this is precisely what we are 
trying to avoid doing for all functions /. Instead, having in mind the balance 
between the rate of convergence and the norm, we will first determine the 
rate of convergence in the total variation norm by using the criterion (1.5) 
and then interpolate intermediate rates of convergence in the /-norm. 
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Thus, this new drift condition not only generahzes former results, but also 
yields a very straightforward way of proving subgeometric rates of conver- 
gence, in particular subexponential rates of the form (1.4). The interpolation 
technique is a key tool to obtain these rates easily and in practice yields all 
usual rates. 

The rest of the paper is organized as follows. Our main result. Theo- 
rem 2.8, is stated and proved in the next section. Several typical functions 
<j) are then considered, leading to a variety of subgeometric rate functions. 
The flexibility of the interpolation technique is illustrated by exhibiting new 
pairs of rates and controls functions. Several applications are given in Sec- 
tion 3. We establish subgeometric rates of convergence (in particular, faster 
than polynomial rates) in several models: first-order nonlinear autoregres- 
sive models, stochastic unit root models and the random walk multidimen- 
sional Hastings-Metropolis algorithm, under conditions which do not imply 
geometric ergodicity. These examples should illustrate the efficiency of Con- 
dition Ti{(f),V ,C) and the fact that it is indeed easier to check than the 
sequence of drift conditions (1.5). 



2. Main result. 



2.1. Rate of convergence in the total variation norm. Let (^:[l,oo)— > 
(0, oo) be a concave nondecreasing differentiable function. Define 

(2.1) H^{v):= ^ 

Then i?^ is a nondecreasing concave differentiable function on [1, oo). More- 
over, since (j) is concave, (j)' is nonincreasing. Hence (p{v) < + (j)'{l){v — 1) 
for all > 1 , which implies that increases to infinity. We can thus define 
its inverse H^^ :[0,oo) — >• [l,oo), which is also an increasing and differen- 
tiable function, with derivative (if^')'(x) = (j)oH^^{x). For A; G N, z > and 
V >l, define 

r^(z):=iH^'y{z) = (l)oH^\z), 

(2.2) Hkiv) := j^^"^"^ r^{z + k)dz = H^' (H^v) + k) - H^\k\ 

Vu := Hk o V. 

We will show that, provided Condition J){(j),C ,V) holds with C petite and 
sup3,g(^F(x) < oo, the chain (O^, /c > 0) is (1, r^)-regular; that is, r^ is the 
rate of convergence in total variation norm that can be deduced from the 
drift condition. To this end, we will use Theorem l.l(ii); that is, we will 
show that (1.5) holds with (V^, fc > 0), / := 1 and r := r^. 
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Proposition 2.1. Assume 'D((j),V,C). Then is log concave and, for 
all k>0, Hk is concave and 

Proof. Note first that r'^{z) / r (f,{z) = cp' o H^^(z) for all z>0. Since (j)' 

is nonincreasing and H^^ is increasing, <f>' oH^^ is nonincreasing and logCr,^) 
is concave. This implies that, for any fixed k > 0, the function z r^{z + 
k)/r^{z) is a decreasing function. The derivative of has the following 
expression: 

H't,{v) = r^{H^{v) + k)/m 

(2.3) 

= r^{H^{v) + k)/r^{H^{v)). 

Since i/^ is increasing, it follows from the discussion above that H'^ is non- 
increasing; hence is concave for all A; > 0. Applying (2.3) and the fact 
that r^f, is increasing, we obtain 

Hk+i(v) - Hkiv) 

{r^(z + k + \) - r^(z + k)} dz 



/ / r'Jz + k + s) ds dz 

Jo Jo 

: / {r^{H^{v) + k + s)- r^{k + s)} ds 
Jo 



<r^{H^iv) + k + l) -r^ik) 

= cP(v)Hi^,iv) - r^k). 

We have thus shown the following inequality which is the key tool of the 
proof: 

(2.4) Hk+i(v) - (k{v)H'k+x{v) < Hkiv) - r^k). 

Let g he a concave differentiable function on [l,oo). Since g' is decreasing, 
for all f > 1 and x G M such that v + x> 1 , it holds that 

(2.5) g{v + x)<g{v) + g'{v)x. 

Applying this property to the concave function Hk+i, we obtain for all A; > 0, 
xe {V < oo}, 

PVk+i{x) < Hk+i{V{x) - o V{x) + h\c{x)} 

< Hk+i{V{x)) - 4> o V{x)H'k+,{V{x)) + hH'k+,{V{x))\c{x) 

< Hk+,(V(x)) - cP o V{x)HL,(Vix)) + bHL,{l)lcix). 
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Applying (2.3) and (2.4), we obtain that -ff^+iCl) = r^{k + l)/r^(0) and 

This inequahty still holds for a; € { ^ = oo } , which concludes the proof. □ 

The drift condition Y)[(f),V ,C) and Proposition 2.1 imply the following 
bounds for the modulated moments of the return time rc, by application 
of Proposition 11.3.2 in MT. 

Proposition 2.2. Assume B((f>,V,C). Then, for all x G X, 



TC-l 

L k=Q 



rc - 1 
. fc=0 



< V{x) + blcix), 



<V(x)+'-^lc(x). 



In order to apply Theorem 1.1, we must also check the following conditions: 

(i) the rate sequence r^:={<j)o H^^{k), k>0) belongs to A, 

(ii) the drift function V is bounded on C. 

The next lemma gives a simple criterion to check that € A. 
Lemma 2.3. // lim^^oo (p'it) = 0; ^^en G A. 

Proof. We have already noted that r^(x)/r<^(x) = (f)' oH^^{x) for all x > 
0. Let r be any differentiable function such that r(0) = 1 and lim^^oo r'{x)/r(x)= 0. 
Then, applying Cesaro's lemma, we obtain, 

log(r(n)) 1 f^r'is) 



n 



n Jo r{s) 



■ds^O. 



If, moreover, r'/r decreases, then log(r(x))/x also decreases. Thus € A. 

□ 



If the condition sup^^Q V{x) < cx) is not satisfied and if the set C is petite, 
the drift condition 'D{(p,V,C) can be slightly modified so that it holds with 
a new set C on which V is bounded. The following lemma, adapted from 
Theorem 14.2.3 of MT, states this formally. 

Lemma 2.4. Assume that 'D((p,V,C) holds for some petite set C and 
that lim^i^oo <p{v) = oo. Then, for all M > I, the sublevel sets {x € X, V{x) < 
M} are petite. In addition, for any (3, < f3 < I, there exists a sublevel set 
Cp such that Ti((3(j),V ,Cp) holds. 
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Proof. Since (j) is positive nondecreasing and V >l, Condition D((^, V, C) 
implies the drift condition PV <V - (/>(1) + hlc- Theorem 11.3.11 of MT 
shows that, for all accessible sets B € S'^(X), there exists a constant c{B) < 
oo such that, for all x G X, we have 4'{1)Kx[tb] < V{x) + c(B). Hence, every 
set A G B(X) such that sup^.g^y(x) < oo is (1, l)-regular, and the sublevel 
sets are all (1, l)-regular. Proposition 11.3.8 of MT shows that if a set A is 
(1, l)-regular, then it is petite. Hence, all the sublevel sets are petite. 

Since Um^,^oo </'(^^) = oo for all /? G (0, 1), there exists such that v > Mp 
implies (piv) > 6/(1 - For x ^ C/3 := {V < M^}, we thus have 6 < (1 - 
P)(j)iVix)) and 

PV + (3(j){V) <V + {(3- l)(P{V) + blc < V. 
For X gC^, since (3 £ (0, 1), it trivially holds that 

PV + p(l)iV) <V + b. □ 

We are now in position to establish the rate of convergence in total vari- 
ation distance. 

Proposition 2.5. Let P be a ip -irreducible and aperiodic kernel. As- 
sume that 'D((j),V,C) holds for a function (j) such that lim^^oo ^^'(i) = 0, a 
petite set C and a function V such that {V < 00] ^ 0. Then, there exists 
an invariant probability measure vr, and for all x in the full and absorbing 
set {V < 00}, 

limr^(n)||P"(a;,-)-vr(0||Ty =0. 

Any probability measure A such that \{V) < 00 is {\,r(f)) -regular and for two 
(I, r^) -regular distributions A, /i, there exists a constant c such that 

Y^r^in) J J X{dx)n{dy)\\P''{x,-) - P''{y,-)\\TV 

n=0 

<c{X(V) + i^iV)). 

Remark 1 . Since (j)' is nonincreasing, if we do not assume that limt,_+oo 4''iv) = 
0, then there exists c G (0, 1) such that lim^-^oo 4'' (v) = c > 0. This yields 
V — (j){v) < (1 — c)v + c — (/'(I). In this case, Condition D(0, V, C) implies the 
Foster-Lyapunov drift condition, and the chain is geometrically ergodic. 

Proof of Proposition 2.5. The only statement which requires a proof 
is the fact that any probability measure such that XiV) < cxd is (1, r^)-regular. 
This assertion is established in Proposition 3.1(ii) of Tuominen and Tweedie 
(1994), and relies on Lemma 3.1 of Nummelin and Tuominen (1983). We 
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nevertheless propose a proof that shortens the previous one. The proof is 
adapted from the proof of Theorem 14.2.3 of MT. Proposition 2.1 shows 
that there exist a sequence of drift functions (14, A: > 0) and a constant b 
such that Vq <V and 

PVk+i <Vk- r^{k) + h<j){\)-'r^{k + l)lc. 

Dynkin's formula shows that, for all accessible set B, 



TB-l 

. fc=0 



< Vo{x) + 6(/)(l)"iE^ 



■rs-l 
. fc=0 



+ l)lc(<J>A:) 



From Propositions 5.5.5 and 5.5.6 of MT, we can assume without loss of gen- 
erality that C is V'a-petite, where V'a is equivalent to tp, and that the sampling 
distribution a has finite mean := X^j^i i'^j < By the Comparison The- 
orem (MT, Theorem 14.2.2), the bound \c{x) < 'ilJa{B)~'^Ka{x,B) and the 
fact that is nondecreasing, we have 



TB-l 

. fc=0 



< Vo(a;) + 6</)(l)-^E^ 



TB-l 
. A:=0 



< Vq{x) + 6</.(l)- Va(5)~' J2 



i>0 



+ l)lB(<J>fc+i) 



Tfl-l 

. fc=0 

< Vq{x) + 6</.(l)- Va(S)"'maEJr<^(rB)]. 
For A: > 1, define R^{k) := E^t^ r^ij)- 

Since is subgeometric, it holds that 
limk^oc r^(k)/R(j,(k) = 0. Hence, for any 5>0, there exists a constant c((5) 
such that, for all A; > 1, r(^(fc) < 5R^{k) + c{5). This yields 

^AR^{tb)] < Voix) + 6,/.(l)- Va(5)"^n,(5E,[i?^(rB)] + c{6)). 

Thus for small enough 6, we obtain 

VJ,(x) + 6ma^-kW<5)0(l)-' 



(2.6) 



K^R^iTB)] < 



l-b6mai^a\B)(j){iy 



□ 
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2.2. Rate of convergence in f -norms. As already mentioned in the poly- 
nomial case and discussed in Tuominen and Tweedie (1994), in the subgeo- 
metric case there is a compromise between the rate of convergence and the 
control function. In what follows, we will show that it is possible at almost no 
cost to obtain many intermediate rates of convergence and control functions. 
Let y be the set of pairs of ultimately nondecreasing functions *Fi and ^2 
defined on [1,cxd) such that lim^^-joo^I'iC^;) = 00 or lim^^-joo ^I'aC^;) = 00 and, 
for all X, y € [1, 00), 



The set 3^ includes, for example, ^i(,x) = x and ^2{x) = 1, but there are of 
course more interesting examples. For example, it is well known that, for 
any x,y>0, and p and q such that 1/p + 1/g = 1, we have 

xy < x^/p + y'^/q. 

Hence, the pair of functions = p^^Px^^'^ , ^2{x) = c^l'^x^l'^ satisfies (2.7). 

These are precisely the interpolating functions used in Jarner and Roberts 
(2002) to derive polynomial rates of convergence. Young functions pro- 
vide many useful interpolating functions. We recall their definition. Let 
Q\ : (0, 00) (0, 00) be an increasing left-continuous function such that Qi(0) = 
and lim^,_^4.oo Qiiv) = +00. Let g2 be the left-continuous inverse of ^1 , which 
is increasing and satisfies also Q2i0) = and lim^^^oo Qiiv) = +00. Define 
then Gi{x) := Jq gi(t)dt, i = 1,2. The well-known Young inequality states 
that, for all x,y>0, we have 



Let be the inverse of Gj, i = 1,2. Then *Fi and ^2 are concave functions 
and it follows immediately from (2.8) that the pair (*Fi,*F2) satisfies (2.7). 

We use this full scale of interpolating functions in combination with 
Proposition 2.2 to derive bounds for the modulated moment of return time 
to the set C. More precisely, we have the following. 

Proposition 2.6. Assume 'D((j),V,C) and let ^¥1,^2) Then 



(2.7) 



'i'y{x)^2iy)<x + y. 



(2.8) 



xy<Gi{x) + G2{y). 



'TC-l 



L A:=0 



< 2V{x) + b{\ + r^i\)/r40))lcix). 



We need a criterion for a rate function *Fi o r^p to be subgeometric. Note 
that if the pair ^¥1,^2) belongs to y, then, for large enough x, it holds that 
'¥i(x)<2x, i = \,2. 
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Lemma 2.7. Assume that limt_>oo <P'{t) = 0. For any nondecreasing func- 
tion *F such that < ax for some constant a, *F o € A. 

The next theorem summarizes all our previous results. 

Theorem 2.8. Let P he a tp -irreducible and aperiodic kernel. Assume 
that T)((j),V ,C ) holds for a function (p such that limt^oo </''(i) = and a 
petite set C such that sup^;; < cxo. Let (*Fi,*F2) G 3^. Then, there exists an 
invariant probability measure tt, and for all x in the full set {V <oo}, 

lim»Fi (r^(n)) •) - ^:{■)h,^^oV) = 0- 

Any probability measure A such that \(y) < oo is (*F2((/)o y),*Pi(r(^)) -regular, 
and for two such distributions A,/i, there exists a constant c such that 

E (^^C")) / / KdxMdy)\\P''ix, •) - ■)h,(^oV) 

n=0 

<ciX(V) + fi(V)). 



Proof. From Proposition 2.6 we have 



sup Ej; 



»Fi(r^(fc))»P2(</'oy(Ofc)) 

. fc=0 



< oo. 



Theorem 1.1 shows that O is {^2{(p ° V),^i{r(j)))-iegulai. As in the proof 
of Proposition 2.5, and using again the comparison theorem, for any set 
B € ;B"^(X), there exist constants ci{B) and C2{B) such that 



E, 



TB-l 
L k=0 



+ E, 



E 

L k=0 



Hence, for any {^1,^2) G 3^, we have 



Ex 



TB-i 
. k=0 



<ci{B)Vix) + C2{B). 



<ciiB)V{x) + C2iB), 



which shows that any probability measure such that X{V) < cxo is (^2(4' 
*Fi(r^))-regular. □ 

2.3. Some usual rate functions. In this section, we provide examples of 
rates of convergence obtained by Theorem 2.8 for several functions (j). In Sec- 
tion 3, we will provide examples and explicitly determine the drift function 
V and the set C. For two sequences Un and Vn, we write Un^Vn, if there 
exist positive constants ci and C2 such that, for large n, ciUn <Vn< C2n.«. 

We assume throughout this section that Condition Ti[(j),V ,C) holds for 
some petite set C and sup^ V <(Xi. 
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Polynomial rates of convergence. Polynomial rates of convergence have 
been widely studied recently under various conditions [see Veretennikov 
(1997, 2000), Tanikawa (2001), Jarner and Roberts (2002) and Fort and Moulines 
(2003)]. As already mentioned, polynomial rates of convergence are associ- 
ated to the functions (j){v) := cv" for some a G [0, 1) and c G (0, 1] and the rate 
of convergence in total variation distance is r^(n) oc n"/^'~"\ Set ^i{x) := 
((1 - and »F2(x) := (px)P for some p, < p < \. Applying Theo- 

rem 2.8 yields, for x G {1/ < oo}, 

(2.9) limn(i-P)"/('-'^)||P"(x,-)-vr(-)||y<:«P =0. 

n 

This convergence remains valid for p = 0,\ by Proposition 2.2. Set k := 
1 + (1 - p)a/{l - a) so that 1 < k < 1/(1 - a). With this notation, (2.9) 
reads 

limn''-' II-P"(x, •) - 7r(-)||yi-K(i-c) = 0, 

n 

which is Theorem 3.6 of Jarner and Roberts (2002). 

It is possible to extend this result by using more general interpolation 
functions. We can, for example, obtain nonpolynomial rates of convergence 
with control functions which are not simply power of the drift functions. 
To illustrate this point, set for 6 > 0, *Fi(x) := (1 V log(x))* and ^2ix) '■= 
x{\ V log(x))-''. It is not difficult to check that we have 

sup {x + y)~^'^i{xYi>2{y)<oo, 

(x,j/)G[l,oo)x[l,oo) 

SO that, for all x G {y < oo}, we have 

(2.10) limlog''(n)||P"(x, •) - ^(•)||y.(i+iog(i/))-i> = 0, 

(2.11) limn°Ai-°)log-V)||^'"(3;,-) -vr(0||(i+iog(y))^ =0, 
and for all < p < 1 , 

limn(i-P)"/(i-"Uog%||P"(x, •) - vr(-)||y.p(i+iogy)_. = 0. 

Logarithmic rates of convergence. We now consider drift conditions which 
imply rates of convergence slower than any polynomial. Such rates are ob- 
tained when Condition Y){(f),V ,C) holds with a function (f) that increases 
to infinity slower than polynomially. We only consider here the case (j){v) = 
c(l + log(t;))" for some a > and c G (0, 1]. A straightforward calculation 
shows that 

r0(?i) xlog"(n). 

Proposition 2.5 shows that the chain is (l,log"(n)) and ((1 +logF)", 1)- 
regular. Applying Theorem 2.8, intermediate rates can be obtained along 
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the same lines as above. Choosing, for instance, ^i{x) :=((! — p)x) and 
^lix) '.= {px)P for <p < 1, the chain is ((1 +log V)^",log(n)^'~P^")-regular 
and thus, for ah x G < oo}, 

lim (1 +log(n))^'-P^"||P"(x,-) -vr(-)|I(i+iog(V))p" =0. 

n— >oo 

Subexponential rates of convergence. Subexponential rates [as defined 
in (1.4)] have been considered only recently in the literature. An example 
(in continuous time) has been studied by Malyshkin (2001); discrete-time 
examples are considered in the recent work by Klokov and Veretennikov 
(2002). These rates, which increase to infinity faster than polynomially, are 
obtained when Condition 'D{(j),V,C) holds with (j) such that, v/(p{v) goes to 
infinity slower than polynomially. More precisely, assume that (p is concave 
and differentiable on [1,+cxd) and that for large v, (j){v) = cv/log°'{v) for 
some a> and c > 0. A simple calculation yields 

r^(?i)xn-°/<^+"^exp({c(l +a)n} '/(!+")), 

and thus the chain is (1, n-"/<i+") exp({c(l +a)n}i/(i+"^)) and {V/{1 +Iog T/)", 1)- 
regular. Applying Theorem 2.8 with := x'~^(l Vlog(x))~^ and ^2ix) '■ = 

xP{\ V log(x))^ for p G (0, 1) and beM.; p = and 6 > 0; or p = 1 and b < -a 
yields, for all x G {F < oo}, 

Iimn-(°+^)/(i+")exp((l -p){c(l + a)n} '/<'+">) 

(2.12) 

X ||P"(x, •) - vr(-)||yp(i+iogy)6 = 0. 

Asymptotically, the term 7^-("+'')/(i+") ig not very important and we can 
express (2.12) in a simpler way: for all d<(\ — p){c{l + a)}^^^^^"\ we have, 
for the same values of p and b, 

lime-^^'^'^^'llP^C^, •) - vr(.)||yp(i+iogy). = 0. 

3. Applications. We now illustrate our findings by applying Theorem 
2.8 to several models. In Section 3.1, we exhibit a simple example where the 
rates obtained in Theorem 2.8 can be proved optimal. In the next sections 
we study several examples where no such optimality results are available. 

In this section, (•,•) and | ■ | denote, respectively, the scalar product and 
the Euclidean norm in any Euclidean space. The transpose of a vector v is 
denoted v' .li u is a twice continuously differentiable real valued function on 
M.'^, Vu (resp. V^u) denotes its gradient (resp. its Hessian matrix). 
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3.1. Backward recurrence time chain. The backward recurrence time 
chain (see MT, Section 3.3.1) is a rich source of simple examples of stable 
and unstable behavior. We consider it here to provide examples of chains sat- 
isfying Condition D((/), y, C) and for which the rates of convergence implied 
by it are optimal. 

Let (pn, n > 0) be a sequence of positive real numbers such that pQ = 
1, Pn & (0, 1) for all n > 1 and lim„_>oo 117=1 — ^- Consider the backward 
recurrence time chain <I> with transition kernel P defined as P{n,n + 1) = 
1 — P(n, 0) =Pn, for all n>0. Then is irreducible and strongly aperiodic 
and {0} is an atom. Let tq be the return time to {0}. We have, for all n > 1, 

n— 1 n— 1 

IPo(ro = n + l) = {l-pn)Y[pj and Po(ro >n)=Y[pj. 

j=0 j=0 

By Kac's theorem (MT, Theorem 10.2.2), since O is ^-irreducible and ape- 
riodic, <I> is positive recurrent if and only if Eo[ro] < oo, that is, 



J2 n^'i <°°' 

n=l j=l 

and the stationary distribution vr is given, by -7r(0) 
for j > 2, 



7r(l) = 1/Eo[to] and 



7r(i) 



"o(ro > i) 



P0---Pj-2 



lEo [to] IE() [to] 1 Pi • ■ • Pn 

Because the distribution of the return time to the atom {0} has such a simple 
expression in terms of the transition probability {pn,n > 0), we are able to 
exhibit the largest possible rate function r such that the (1, r)-modulated 
moment of the return time ^oV^^^Zo ^i^)] is finite. We will also prove that 
the drift condition Yi{(j),V ,C) holds for appropriately chosen functions V 
and (j) and yields the optimal rate of convergence. Note also that, for any 
function /, it holds that 



■ro-l 



k=0 



■■En 



■ro-l 



A:=0 



Therefore there is no loss of generality to consider only (1, r)-modulated 
moments of the return time to zero. 

If sup,„>jp„ < A < 1, then, for all A < /i < 1, EqL^"^"] < oo and {0} is thus 
a geometrically ergodic atom (MT, Theorem 15.1.5). Subgeometric rates of 
convergence in total variation norm are obtained when Iimsupp.„ = 1. De- 
pending on the rate at which pn approaches 1, different behaviors can be 
obtained, covering essentially the three typical rates (polynomial, logarith- 
mic and subexponential) discussed above. 
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Polynomial rates. Assume first that, for 6 > and large n, pn = I — 
(1 + e)n-\ Then Ui=iPi n'^'^. Thus, EoEfcTo' ^(k)] < oo if and only if 
X^fc^i r{k)k~^~^ < oo. For instance, r{n) := with < (5 <9 is suitable. 

Logarithmic rates. If for 9 >Q and large n,pn= 1 — 1 /n — (1 + 0)/(nlog(n)), 
then 0^=1 Pj ^ log~^~^(^); which is a summable series. Hence if r is non- 
decreasing and J2'k^=i ^(^) 11^=1 Pi < oo, then r{k) = o(log^(/c)). In particular, 
r{k) := log'^CA;) is suitable for ah < /? < 6*. 

Subgeometric rates. If for large n, p„ = 1 — 6/3n^~^ for some > and 
/3 G (0,1), then U7=iPi ^ e-''"^ Thus, EqEIU^ e"^'""] < if a < ^, and 
EoElL"o e"'""] = oo if a > 0. 

Checking Condition D ((j),V,C ). In order to prove that Proposition 2.5 
provides the optimal rates of convergence, we now compute in each of the 
previous examples the rates of convergence it yields. 

For the polynomial and subexponential cases, the same technique can be 
used. For 7 G (0, 1) and x G N*, define ^(0) := 1 and V(x) := nj=o Pj"' ■ Then, 
for all X > 0, we have, 

PV{x) = p^Vix + 1) + (1 - PxWiO) 

= pi-^V(x) + l-p, 

<V{x)-{i-plr^)V{x) + i-p,. 

Thus, for < 5 < 1 — 7 and large enough x, it holds that 
(3.1) PV{x) < V{x) - 5(1 - PxWix). 

1. Case pn = I - {I + 0)n~\ e > 0. Then ^/(x) x andil -p^)V{x)>i 
X/(a;)i-i/(7(i+e)_ rpj^yg Condition 0(0, y,C) holds with <j)(v) = cv°' for 
a = 1 — 1/(7(1 + 6)) for any 7 G (0, 1). Theorem 2.8 yields the rate of 
convergence 7i<^/(i~°' = n'^^^~^^^~^ , that is, for any < P < 9. 

2. Case pn = l — 9f3n^~'^ . Then, for large enough x, (3.1) yields 

PV{x) < Vix) - 9p6x^-^V{x) 

< cV{x){log{V{x))}^~^/'^, 

for c < 9^/f^ 135. Defining a := 1//? — 1, Proposition 2.1 yields the follow- 
ing rate of convergence in total variation norm: 

n-"/(^+")exp({c(l + a)n}>/('+">) =n^-iexp(05^n^). 

Since 5 is arbitrarily close to 1, we recover the fact that ^q\Y1^^Zq e"'^'^] < 
00 for any a <9. 
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Ix-l , 

for e > arbitrarily small. Then, for constants c < c' < c" < I and large 
X, we obtain 

log^Cx + 1) 



3. Casepn = I -n'^ -il + 0)n~^\og-\n), 6* > 0. Choose Vix) := (UjZo Pj) / ^og 



= V(x)-c"e^^ + l-p. 
X log(3;) 

<V{x)-c'elog^-'{x) 

<V{x) - celog'^~'{V{x)). 

Here again Theorem 2.8 yields the optimal rate of convergence. 

3.2. Symmetric random walk Hastings-Metropolis algorithm. We con- 
sider the symmetric random walk Hastings-Metropolis algorithm. The pur- 
pose of this algorithm is to simulate from a probability distribution vr which 
is known only up to a scale factor. At each iteration, a move is proposed 
according to a random walk whose increment distribution has a symmet- 
ric density q with respect to the Lebesgue measure fj,d on W^. The move is 
accepted with probability a{x, y) defined by 



(3.2) a{x,y):-- 




if -k{x) > 0, 
if 7r{x) = 0. 

The transition kernel of the Metropolis algorithm is then given by 

P{x,A)= / a{x,x + y)q{y)dfid{y) 
J A 

+ iA{x) j {I- a{x, X + y))q{y) d^idiy)- 

Mengersen and Tweedie (1996) have shown that a real valued Metropolis 
chain is geometrically ergodic when the proposal density q satisfies moment 
conditions and the target density tt is continuous, positive and log concave in 
the tails. This condition is necessary in the sense that if the chain is geomet- 
rically ergodic, then / e-xp{s\z\)'K{z) d^d{z) < oo for some s > 0. These results 
have been extended to the multidimensional case by Roberts and Tweedie 
(1996) and Jarner and Hansen (2000). Polynomial ergodicity was proved by 
Fort and Moulines (2000) for a target density with regularly varying tails. 
We now state conditions that imply subexponential rates of convergence. 



Assumption 3.1. The target density vr is continuous and positive on 
and there exist m E (0, 1), r G (0, 1), positive constants di,Di,i = 0,\,2 and 
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Rf) < oo such that, if |a;| > Rq, x tt{x) is twice continuously differentiable 
and 

/ Vtt(x) X \ 

(^•'' {wM-m)--''- 

(3.4) do\xr <-\og'Kix)<Do\xr, 

(3.5) dilx^-^ < |Vlog7r(x)j < Di\xr-\ 

(3.6) djlx^-^ < iV^logvrCx)! < Djlxp-^ 

The Weibull distribution on M with density 7r{x) := f3^x'^~^ exp(—(3x'^), 
for a; > 0, /3 > and < 7 < 1 satisfies Assumption 3.1. Multidimensional 
examples are provided in Fort and Moulines (2000). For the sake of simplic- 
ity, we make the following assumption on the proposal density q. 

Assumption 3.2. The proposal density q is symmetric and bounded 
away from zero in a neighborhood of zero and is compactly supported; that 
is, there exists c{q) such that, for all |y| > c(q), q{y) = 0. 

Theorem 3.1. Under Assumptions 3.1 and 3.2, there exist z>0, c> 0, 
r >0 suchthat'D((j),V,C) holds withV := tt-'^ and (j){v) := OT(l + log?;)-2(i-™)/' 
and C := { \x\ < r] . 

Under Assumptions 3.1 and 3.2, Theorem 2.2 of Roberts and Tweedie 
(1996) shows that the chain is ■0-irreducible and aperiodic and nonempty 
bounded sets of B~^{M.'^) are petite. Thus, we obtain the following corollary. 

Corollary 3.2. There exist c > and z > such that, any probability 
measure X on M.^ satisfying X{V) < 00 is {f,r)-regular with r{n) = e'^"™'''^ 
and f = TT~^ . 

Remark 2. Our result complements the work of Fort and Moulines 
(2000) who show that under Assumptions 3.1 and 3.2, the chain is (/, r)- 
ergodic with f{x) := (1 + \x\^) and r{n) := (1 + n)'^, for any fi> and > 0. 

Remark 3. The compactness Assumption 3.2 can probably be relaxed 
and replaced by an appropriate moment condition. 

Remark 4. We do not provide explicit values of the constants c and 
z here; these values can be deduced explicitly from the proof. It should 
be stressed that optimal values of these constants are related: the larger 
c, the smaller z and vice versa. The same comments apply to Corollar- 
ies 3.4 and 3.6. 
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Proof of Theorem 3.1. Define TZ{x) := [y eR'^, Tr{x + y)< t^{x) } the 
potential rejection region. Using the definition of the transition kernel P, we 
have 

PV{x) - V{x) 

{y{x + y)-V{x))q{y)d^id{y) 

+ / {V{x + y)- Vix)) (^^^^^±A _ i\ q^y) d^i^iy). 
Jnix) V vr(x) / 

Set l{x) := -logvrCx), R{V,x,y) := V{x + y) - V{x) + zV{x){\Il{x),y) and 
R{TT,x,y) := 7r(x + y)/7T{x) — 1 + (Wl{x),y). It is proved in Lemma B.4 of 
Fort and Moulines (2000) that there exists a constant c such that, for large \x\. 

(3.7) sup \R{-K,x,y)\\y\-^ <c\x\^^'^-^\ 

\y\<c(q) 

Using a Taylor expansion with integral remainder term of the function x ^ 
V{x), it is easily shown that there exists a constant c such that, for all 
z € (0, zq) and large \x\, 

(3.8) sup \R{V,x,y)\\y\~^- <cz^V{x)\x\^^'^-^\ 
\y\<c(q) 

Since q is symmetric, we have 
PV{x) - V{x) 

= -zV{x) [ (V/(x), yfqiy) d^idiy) 
Jn(x) 

+ [ R(y,x,y)q(y)dnd(y) 

RiV,x,y){Vlix),y)qiy)dfidiy) 

n(x) 



+ zV{x) I {Vl{x),y)RiiT,x,y)q{y)dnd{y) 

+ / R(V,x,y)R(7r,x,y)q{y)dnd(y)- 
JTUx) 



ni(x) 

Thus, for large |x|, we deduce from (3.7) and (3.8) that 
PV{x) - V{x) 



V{x) 



<-z [ {Vl{x),yfq(y)dtidiy) + cz^\x\^^"'-'^ 
Jn(x) 
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for some positive constant c that does not depend on z. It is shown in Lemma 
B.3 of Fort and Mouhnes (2000) that there exists ij > such that, for large 
\x\, 

(3.9) / {Vlix),yfqiy)dfidiy)>vmxt>v4\x\^^"'-'\ 

JTlix) 

Hence, upon noting that zdolxl"' < \ogV{x), there exists a constant k which 
is positive for z small enough, such that, for large \x\, 

PV{x) - V{x) < -K[logy(x)]-^^^-"*)/"' V{x). 

Since vr is bounded on compact sets, sup|2,|<j,jPy(x) + V{x) < oo and the 
proof is concluded. □ 



3.3. Nonlinear autoregressive model. Consider a process (^n^n ^ 0) that 
satisfies the following nonlinear autoregressive equation of order 1 : 

(3.10) ^n+\ = gi^n) + en+l, 

where the sequence {en,n < 0) and the function g satisfy the following as- 
sumptions. 



Assumption 3.3. {En,n > 0) is a sequence of i.i.d. zero mean, d-dimen- 
sional random vectors, independent of <I>o, that satisfy 



(3.11) 



E 



< OO, 



for some zq>Q and 70 G (0, 1] and the distribution of eo has a nontrivial 
absolutely continuous component which is bounded away from zero in a 
neighborhood of the origin. 



Assumption 3.4. g:W^ is continuous, and there exist r, i?o > 

and p G [0, 2) such that 

(3.12) \g{x)\<\x\{\-r\x\~P) if|x|>i?o- 



There already exists a wide literature on conditions implying a geomet- 
ric rate of convergence for nonlinear autoregressive models [see, e.g., Duflo 
(1997) and Grunwald, Hyndman, Tedesco and Tweedie (2000) and the ref- 
erences therein] . Conditions implying a polynomial rate of convergence have 
been obtained by Tuominen and Tweedie (1994) and AngoNze (1994) and 
have been refined by Veretennikov (1997, 2000), AngoNze (2000) and Fort and Moulines 
(2003). Conditions implying a truly subexponential rate of convergence are 
considered in Klokov and Veretennikov (2002) [see also Malyshkin (2001) 
for diffusion processes]. 
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Theorem 3.3. Assume that Assumptions 3.3 and 3.4 hold. 

(i) If p > 7o, the drift condition D((j),V,C) holds with (j){v) := cv{l + 
log(v))'-^/(To^^2-p))^ y(^) g^|xroA(2-P) (7 := G M<^, |x| < M] for some 

z G (0, zq), c> and M > Rq. 

(ii) // p < 70, t/ien i/ie Foster-Lyapunov condition (1-2) holds with C = 
{x € M"^, |x| < M} for some M > Rq and V{x) = e^'^l^" with z = zq if p <-Jq 
and z € (0, zq) if p = ^q. 

Corollary 3.4. Assume in addition that, for all x G M"^, \g{x)\ < \x\. 
Then the chain is ip -irreducible and aperiodic and compact sets of B~^{W^) 
are petite. 

If P > iQ, then there exists c > and z E (0, zq) such that any proba- 
bility measure A on satisfying \{V) < 00 is {f,r)-regular with r{n) = 

^enl^0A(2-P,./p ^(^)^g.|.roA(2-P)_ 

Proof of Theorem 3.3. Throughout the proof, c is a generic constant 
that can change upon each appearance. Applying the inequahty V{u + w)< 
V{u)V{w), we obtain that in all cases, PV is bounded on compact sets of 
W^. Thus the proof consists in bounding PV — V outside balls. 

(i) We start by examining the case p > 70. Set /? = 70 A (2 — p). We write 

^ ■ ' V{x) V{x) V{x) 

Using the inequality (1 — u)'^" < 1 —^qu for all < u < 1, we have for |x| > Rq, 
\g{x)\^ < \x\^ — (3r\x\^~'', and since — 1 < x + for all x < 0, 

(3.14) ^<^' 

< -zrl3\xf~P + -z2r2/32|2;|2(/5-p). 

Let <r] < 1. We establish that, for small z, large |x| and large I^Cx)!, 

(3.15) PV{x) - V{g{x)) < ^z^P^E[\eo\^V{eQ)]\x\^'^-^V{x). 

Set R{u, w) = V(u + w)- V(u) - {VV(u), w). Since E[eo] = 0, this yields 
PVix)-Vigix)) = E[V{g{x) + eo)]-Vigix)) 
= E[R{gix),eo)], 

and we have to upper bound the remainder term K[R{g{x), eo)]. If \w\ < ri\u\, 
then by using a Taylor expansion with integral remainder term, one has 



\R(u,w)\< [ (l -t)\w'\/^V{u + tw)w\dt 
Jo 



<\\w\^zP sup {{I + zP\u + twf)\u + twf~^V{u + tw)}. 

i6[0,l] 
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Since y ^ |yp^~^e^l^l'^ and y ^ |y|^~^e^l^l'' are ultimately nondecreasing, 
for large \u\ and \w\ < rj\u\, we have 

\R{u,w)\ < ^\w\^zp(\ + zP{\u\ + + \w\f-^V(,u)V(,w) 

(3 17) 

< ^z^P^\w\^V{w)\u\^'^~^V{u) + c\w\^V{w)\uf-^V{u). 
If > ?7|ti|, using again the inequality V{u + < V{u)V{w), 
\R(u, w)\ < V{u + w) + V{u) + \VV{u)\ \w\ 

(3.18) <c\w\V{w)\uf-W{u) 

< c\w\^V{w)\uf-^V{u). 

We now apply (3-17) and (3.18) with u = g{x) and w = Eq. Since y 
\^y\2p-2 ^z\y\f y ^ \^y\^f3~2 ^z\y\f ultimately nondecreasing, for large 
|(7(x)|, we have 

(3 19) 

<\z'(3^\£o\^V{£o)\x\^f^-^V{x) + c\eo\^V{eQ)\x\f^-^V{x). 

Equation (3.15) now follows from (3.19). Gathering (3.14) and (3.15), as 
f3 <2 — p, we obtain that we can choose z < zq, Mi and M2 such that, for 
1 3; I > Ml and \g{x)\ > M2, it holds that 

PV{x) - V{x) < (j){V{x)) 

with (t){v) = -KPzP/'^{l+log{v)}^~p/>^v and 

{r, if /3 < 2 — /9, that is, 70 < 2 — p, 

r - 1 /2(3zE\^£le'\">\^^ , if /3 = 2 - p, that is, 70 > 2 - 

and z is chosen small enough such that k > 0. To conclude, note that if 
\g(x)\ < M2, then PV{x) < V{MiMV{eo)]. Choose Mi such that if |x| > 
Ml , then (t>{V{x)) > V{Mi)E[V{£o)]. Then, defining C = { |x| > Mi }, we have 
that, for all x^C, PV{x) - Vix) < (p{V{x)). 

(ii) We now consider the case /c = 70 [observe that /3 := 70 A (2 — p) = 70 
and that many results above remain valid]. By (3.13), (3.14), (3.16) and 

(3.19) , we have for large |x| and large |(7(x)|, 

PV(x) - V(x) ^ 1 2 2 2 

— — < -zrjo + r 70 

Vix) 2 

+ ^ z V 1^ I '^°"'E[eo^(eo)] ( 1 + o( 1 )) . 

For z small enough, the term on the right-hand side is in the interval (—1,0) 
and this shows that the Foster-Lyapunov drift condition (1.2) holds with 
C of the form {x, \g(x)\ < Mi} U [x, \x\ < M2} for large enough Mi, M2. 
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(iii) We finally consider the case p < 79. Using the inequality (1 — u)"'" < 
1 — 7ott for all < ti < 1, we have for |x| > Rq, \g{x)\'^° < \x\'^° — -yor\x\'^°~P . 
Hence, since V{u + w) < V(u)V{w), this yields, for \x\ > Rq, 

PV{x) = E[V{g{x) + eo)] 

< V{g{xW[V{eo)] 

< ^-'nozo\x\^o-p-^ 

Hence lim^^^^^PV{x)/V{x) = 0, which implies that the Foster-Lyapunov 
drift condition (1.2) holds with C := {\x\ < M} for large enough M. □ 

3.4. Stochastic unit root. We now consider a process which belongs to the 
wide family of stochastic unit root models. See, for example, Granger and Sawnson 
(1997) for many examples. The model we consider is one of the simplest. It 
has been considered in Gourieroux and Robert (2001) with main focus on 
its extremal behavior: 

(3-20) <I>n+l = l{C/„+,<g(<J>„)}<I>n. + £n+l, 

where {en,'^ € N) is a sequence of i.i.d. random variables that satisfies (3.11) 
and {Un, n > 1) is a sequence of i.i.d. random variables, uniformly distributed 
on [0, 1] and independent from the sequence {en,n G N). Moreover, we make 
the following assumption on g. 

Assumption 3.5. g \s a, continuous function with values in [0,1) and 
there exist k e (0, 1), c+C^) > 0, c^{g) < 1 and i?o > such that 

(3.21) yx>Ro 1 - gix) > c+{g)x-'^, 

(3.22) yx<Ro g{x)<c_{g). 

Let P be the transition kernel of the chain. For all x G M and all Borel 
sets A, it can be expressed as 

(3.23) P{x,A) = g{x)¥{x + eo^A) + {l- g{x)) P(eo G A) . 

Under Assumption 3.5, for all M > 0, there exists a constant r]{M) such 
that, for all a; < M and all Borel sets A, 

(3 . 24) P{x,A)> i]{M)F{eo G A) . 

This means that every set of the form (— oo,M] is petite. Define x+ = 
max(x,0). 

Theorem 3.5. Under Assumption 3.5 and if Eq satisfies (3.11), there 
exist z €{0,zq], 5 > and M > Rq such that the drift condition 'D((j),V, C) 

holds with V{x) = 6^^"^+ , (piv) = 5z''/l^v{ 1 V log(t;)}-^/^, C = (-00, M] and (3 
and T are given according to the value o/E[eo] by: 



V{x). 
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(i) /9 = 7o A (1 — k) and t = k, if E[eo] > 0; 

(ii) /? = 70 A (1 - k/2) T = K,if E[eo] = 0; 

(iii) /9 = 7o and r = (1 — 70) A k, if E[eo] < 0. 

Corollary 3.6. Under the same assumptions, the chain is il^ -irreducible 
and (strongly) aperiodic and there exist c> and z > such that any prob- 
ability measure A on M.'^ satisfying X(V) < 00 is {f,r)-regular with: 

(i) ,(^) = e-'^o-<-^..A.o-(.-+- = ^/E[eo]>0; 

(ii) ,(„)^gcn<^oA(.-./w.-./2,+., ^/E[eo]=0; 

(iii) r(n) = ecn^o/(«A(i-,o,+,o, ^^^^ ^ ^,,^0^ ^^^^^^^ ^ ^ 

Proof of Theorem 3.5. Let z < zq and x > 0. Using the definition of 
the transition kernel P, we have 

PV{x) - V{x) = g{x)E[V{x + Eq)] + (1 - g{x))E[V{£o)] - V{x) 

= g(x){E[V{x + eo)] - V{x)) - (1 - g{x)) {V{x) - E[V{eo)]) 
< E[V{x + eo)] - V{x) - ( 1 - g{x)) {V{x) - E[V{eo)]) . 

Define R{x,eo) = V{x + eo) — V{x) - eoPzx^~^V{x). For any t] G (0, 1), we 
can write 

E[Vix + eo)] - V{x) - pzEleolxf^'^Vix) 

= E[R{x,eo)ll\eo\<r,x}] +'^[Rix,£o)l{\eo\>vx}]- 

By the same arguments as in the proof of Theorem 3.3, we have 

(3 25) ^[^(^'^o)l{|.o|>^x)] 

<E[V{(l+rj"')\£o\)+V(\eo\) + [5zr,'-f'\Eo\f^V(\Eo\)]. 

Thus this term is bounded provided that rj and z are chosen such that 
< zo- To bound the second term, note that for large enough x, 
the function x x^^~^V{x) is increasing. Thus, for x > M, for some M 
depending on r], and |eo| < t]x, there exists t S (0, 1) such that 

V{x + eo) - V{x) - f3zeox'^-^V{x) 

= \(3{I3 - l)z{x + teQ)l^-^elV{x + teo) 

+ \ {f3z{x + teo)^" ^felV{x + teo) 

< ^f3h\\+v)^''-^x^''-^elV{x)V{\eo\) 

< \(3^z'x'P-^elV{x)V{\eo\). 
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For c < c+ig) and x large enough, say x> M for some M > Rq, we have 

(1 - g{x)) {V{x) - E[V{eo)]) > cx-^V{x). 

Hence, taking (3.25) into account, there exists a positive real number M 
such that, if x> M, then 

PV{x) - V{x) 

< {z/3x^"^E[eo] + \(3^z^x^f^-^E[£lV{\eo\)] - cx-^)V{x). 

If E[eo] > 0, set /? = 70 A (1 — k). Then, for large enough x, we obtain 

PV{x) - V{x) < -dx-^Vix) 

= -(52'^/^y(x){log(y(x))}-^/^, 

with 5 = c< c^ig) if 70 < I — k or 6 = c — (3zE,[eo], c < Cj^{g) and z such that 
5 > if 70 > 1 — K. If E[eo] < 0, set (3 = and r = (1 — 70) A k. Then, for x 
large enough, 

PV{x) - V{x) < -fe-^F(x) 

= -6z^/^''Vix)[logiV(x))}-^/^°, 

with 5 = c < c+{g) if 70 < 1 — k and 6 = c — 2;/?E[eo], c < c+(g) and z such 
that (5 > if 7o > 1 — K. If IE[eo] = Oj then /? must satisfy 2/3 — 2 < — k; thus 
we set /3 = (1 — k/2) A 70, and we obtain 

PV{x) - V(x) < -6x-^V{x) 

= -6z''/^Vix)[logiVix))}~^/'^, 

with 6 = c< c+{g) if 1 - k/2 > 70 and 6 = c- \P^z^E[elV{\eo\)], with c < c+{g) 
and z such that (5 > if 1 — k/2 < 70. □ 
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